Analytical Biases Associated with GC-Content in Molecular Evolution
نویسندگان
چکیده
Molecular evolution is being revolutionized by high-throughput sequencing allowing an increased amount of genome-wide data available for multiple species. While base composition summarized by GC-content is one of the first metrics measured in genomes, its genomic distribution is a frequently neglected feature in downstream analyses based on DNA sequence comparisons. Here, we show how base composition heterogeneity among loci and taxa can bias common molecular evolution analyses such as phylogenetic tree reconstruction, detection of natural selection and estimation of codon usage. We then discuss the biological, technical and methodological causes of these GC-associated biases and suggest approaches to overcome them.
منابع مشابه
Mutational Biases and GC-Biased Gene Conversion Affect GC Content in the Plastomes of Dendrobium Genus
The variation of GC content is a key genome feature because it is associated with fundamental elements of genome organization. However, the reason for this variation is still an open question. Different kinds of hypotheses have been proposed to explain the variation of GC content during genome evolution. However, these hypotheses have not been explicitly investigated in whole plastome sequences...
متن کاملEvidence for a high ancestral GC content in Drosophila.
Study of the nucleotide composition in Drosophila, focusing on the saltans and willistoni groups, has revealed unanticipated differences in nucleotide composition among lineages. Compositional differences are associated with an accelerated rate of nucleotide substitution in functionally less constrained regions. These observations have been set forth against the extended opinion that the patter...
متن کاملThe consequences of base pair composition biases for regulatory network organization in prokaryotes.
Given the dramatic variation in guanine-cytosine (GC) content observed in prokaryotes, from approximately 20% to approximately 75% GC, one wonders if these extreme biases in base pair composition affect the evolution of transcription factor-binding sites (BS). This letter shows that, along the wide range of GC content variation in bacteria, bacterial BS keep a high frequency of AT bases, roughl...
متن کاملIsochores, GC3 and mutation biases in the human genome.
In this work we re-examined the hypothesis that the variation in GC content in the human genome is due to different regional mutational biases. For this purpose we inferred the mutational pattern by using mutation databases that are available for many genes associated with human genetic diseases. The assumption of this approach is that such mutations reflect the actual frequency distribution of...
متن کاملCan mutation or fixation biases explain the allele frequency distribution of human single nucleotide polymorphisms (SNPs)?
One of the most abiding controversies in evolutionary biology concerns the role of neutral processes in molecular evolution. A main focus of the debate has been the evolution of isochores, the strong and systematic variation of base composition in mammalian genomes. One set of hypotheses argue that regions of similar GC are owing to localised mutational biases coupled with neutral evolution. Th...
متن کامل